Optical Character Recognition of Amharic Documents
نویسندگان
چکیده
منابع مشابه
Optical Character Recognition of Amharic Documents
In Africa around 2,500 languages are spoken. Some of these languages have their own indigenous scripts. Accordingly, there is a bulk of printed documents available in libraries, information centers, museums and offices. Digitization of these documents enables to harness already available information technologies to local information needs and developments. This paper presents an Optical Charact...
متن کاملIntegrating Optical Character Recognition and Machine Translation of Historical Documents
Machine Translation (MT) plays a critical role in expanding capacity in the translation industry. However, many valuable documents, including digital documents, are encoded in non-accessible formats for machine processing (e.g., Historical or Legal documents). Such documents must be passed through a process of Optical Character Recognition (OCR) to render the text suitable for MT. No matter how...
متن کاملAmharic Character Recognition using a Fast Signature Based Algorithm
The Amharic language is the principal language of over 20 million people mainly in Ethiopia. An extensive literature survey reveals no journal or conference papers on Amharic character recognition. The Amharic script has 33 basic characters each with seven orders giving 231 distinct characters, not including numbers and punctuation symbols. The characters are cursive but not connected and unlik...
متن کاملOptical Character Recognition
This paper describes two implementations in optical character recognition using template matching method and feature extraction method followed by support vector machine classification. With proper image preprocessing, the texts are segmented into isolated characters and the correlations between a single character and a given set of templates are computed to find the similarities and then ident...
متن کاملOptical Character Recognition Systems
Abstract Optical character recognition (OCR) is process of classification of optical patterns contained in a digital image. The character recognition is achieved through segmentation, feature extraction and classification. This chapter presents the basic ideas of OCR needed for a better understanding of the book. The chapter starts with a brief background and history of OCR systems. Then the di...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: African Journal of Information & Communication Technology
سال: 2007
ISSN: 1449-2679
DOI: 10.5130/ajict.v3i2.543